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Queries: Predicting query performance 

Steve Cronen-Townsend, Yun Zhou, W. Bruce Croft 

August 2002 Proceedings of the 25th annual international ACM SIGIR conference on 

Research and development in information retrieval 

„ . . .. . . si, „ .x/oco -fa Additional Information: full citation , abstract , references , citings , index 

Full text available: pdf(258.74 KB) terms 

We develop a method for predicting query performance by computing the relative entropy 
between a query language model and the corresponding collection language model. The 
resulting clarity score measures the coherence of the language usage in documents whose 
models are likely to generate the query. We suggest that clarity scores measure the 
ambiguity of a query with respect to a collection of documents and show that they correlate 
positively with average precision in a variety of TREC tes ... 



Keywords: ambiguity, clarity, information theory, language models 



Poster papers - short papers: The link prediction problem for social networks 
David Liben-Nowell, Jon Kleinberg 

November 2003 Proceedings of the twelfth international conference on Information and 
knowledge management 

Full text available: ^ pdf(85.72 KB) Additional Information: full citation , abstract , references , index terms 

Given a snapshot of a social network, can we infer which new interactions among its 
members are likely to occur in the near future? We formalize this question as the link 
prediction problem, and develop approaches to link prediction based on measures the 
"proximity" of nodes in a network. Experiments on large co-authorship networks suggest 
that information about future interactions can be extracted from network topology alone, 
and that fairly subtle measures for detecting node proximity c ... 

Keywords: link analysis, link prediction, social networks 



Predicting the performance of linearly combined IR systems 
Christopher C. Vogt, Garrison W. Cottrell 

August 1998 Proceedings of the 21st annual international ACM SIGIR conference on 
Research and development in information retrieval 

Full text available: ^ pdf(896.96 KB) Additional Information: full citation , references , citings , index terms 
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An efficient boosting algorithm for combinin g preferences 
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Yoav Freund, Raj Iyer, Robert E. Schapire, Yoram Singer 

December 2003 The Journal of Machine Learning Research, volume 4 

Full text available: 1 ^ pdf(392.20 KB) Additional Information: full citation , abstract , index terms 

We study the problem of learning to accurately rank a set of objects by combining a given 
collection of ranking or preference functions. This problem of combining preferences arises in 
several applications, such as that of combining the results of different search engines, or the 
"collaborative-filtering" problem of ranking movies for a user based on the movie rankings 
provided by other users. In this work, we begin by presenting a formal framework for this 
general problem. We then describe and ... 



5 Tree induction vs. logistic regression: a learning-curve analysis 
Claudia Perlich, Foster Provost, Jeffrey S. Simonoff 
December 2003 The Journal of Machine Learning Research, Volume 4 

Full text available: ^ pdf(263.37 KB) Additional Information: full citation , abstract , citings , index terms 

Tree induction and logistic regression are two standard, off-the-shelf methods for building 
models for classification. We present a large-scale experimental comparison of logistic 
regression and tree induction, assessing classification accuracy and the quality of rankings 
based on class-membership probabilities. We use a learning-curve analysis to examine the 
relationship of these measures to the size of the training set. The results of the study show 
several things. (1) Contrary to some prior o ... 




6 Technique for automatically correcting words in text 
Karen Kukich 

December 1992 ACM Computing Surveys (CSUR), Volume 24 issue 4 

Full text available* S pdf(6.23 MB) Additional Information: full citation , abstract , references , citings , index 

" terms , review 

Research aimed at correcting words in text has focused on three progressively more difficult 
problems:(l) nonword error detection; (2) isolated-word error correction; and (3) context- 
dependent work correction. In response to the first problem, efficient pattern-matching and 
n-gram analysis techniques have been developed for detecting strings that do not appear in 
a given word list. In response to the second problem, a variety of general and application- 
specific spelling cor ... 

Keywords: n-gram analysis, Optical Character Recognition (OCR), context-dependent 
spelling correction, grammar checking, natural-language-processing models, neural net 
classifiers, spell checking, spelling error detection, spelling error patterns, statistical- 
language models, word recognition and correction 



7 Evaluating collaborative filtering recommender systems 

Jonathan L. Herlocker, Joseph A. Konstan, Loren G. Terveen, John T. Riedl 
January 2004 ACM Transactions on Information Systems (TOIS), Volume 22 issue l 

Full text available: ^ pdf(253.92 KB) Additional Information: full citation , abstract , references , index terms 

Recommender systems have been evaluated in many, often incomparable, ways. In this 
article, we review the key decisions in evaluating collaborative filtering recommender 
systems: the user tasks being evaluated, the types of analysis and datasets being used, the 
ways in which prediction quality is measured, the evaluation of prediction attributes other 
than quality, and the user-based evaluation of the system as a whole. In addition to 
reviewing the evaluation strategies used by prior researchers ... 

Keywords: Collaborative filtering, evaluation, metrics, recommender systems 
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Modeling transcription programs: inferring binding site activity and dose-response 
model optimization 
Amos Tanay, Ron Shamir 
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April 2003 Proceedings of the seventh annual international conference on 
Computational molecular biology 

Full text available: ^ pdf(277.65 KB) Additional Information: full citation , abstract, references , index terms 

The modeling of transcription regulation programs is a major focus of today's biology. The 
challenge is to utilize diverse high-throughput data (gene expression, promoter binding site 
localization assays, protein expression) in order to infer the mechanistic models of 
transcription control. We propose a new model which integrates transcription factor-gene 
affinities, protein abundance and gene expression levels. Transcription factor binding site 
activity is represented by a dose-affinity-respons ... 

9 Modeling user behavior: Predicting human interruptibility with sensors: a Wizard of Oz Qjj 
feasibility study 

Scott Hudson, James Fogarty, Christopher Atkeson, Daniel Avrahami, Jodi Forlizzi, Sara 
Kiesler, Johnny Lee, Jie Yang 

April 2003 Proceedings of the conference on Human factors in computing systems 

Full text available - fi?) pdf(727 56 KB) Additional Information: full citation , abstract , references , citings , index 
• y^-H— = terms 

A person seeking someone else's attention is normally able to quickly assess how 
interruptible they are. This assessment allows for behavior we perceive as natural, socially 
appropriate, or simply polite. On the other hand, today's computer systems are almost 
entirely oblivious to the human world they operate in, and typically have no way to take into 
account the interruptibility of the user. This paper presents a Wizard of Oz study exploring 
whether, and how, robust sensor-based predictions of ... 

Keywords: context-aware computing, machine learning, sensor-based interfaces, 
situationally appropriate interaction 



10 Special issue on using large corpora: II: Coping with ambiguity and unknown words 
through probabilistic models 

Ralph Weischedel, Richard Schwartz, Jeff Palmucci, Marie Meteer, Lance Ramshaw 
June 1993 Computational Linguistics, volume 19 issue 2 

Full text available:^ |f| 

H|pdf(l.59 MB) w Additional Information: full citation , abstract , references 
Publisher Site 

From spring 1990 through fall 1991, we performed a battery of small experiments to test the 
effectiveness of supplementing knowledge-based techniques with probabilistic models. This 
paper reports our experiments in predicting parts of speech of highly ambiguous words, 
predicting the intended interpretation of an utterance when more than one interpretation 
satisfies all known syntactic and semantic constraints, and learning caseframe informationfor 
verbsfrom example uses. From these experiments, w ... 

11 A study of smoothing methods for language models applied to information retrieval 
Chengxiang Zhai, John Lafferty 

April 2004 ACM Transactions on Information Systems (TOIS), Volume 22 issue 2 

Full text available: ^ pdf(296.22 KB) Additional Information: full citation , abstract , references , index terms 

Language modeling approaches to information retrieval are attractive and promising because 
they connect the problem of retrieval with that of language model estimation, which has 
been studied extensively in other application areas such as speech recognition. The basic 
idea of these approaches is to estimate a language model for each document, and to then 
rank documents by the likelihood of the query according to the estimated language model. A 
central issue in language model estimation is smo ... 

Keywords: Dirichlet prior smoothing, EM algorithm, Jelinek— Mercer smoothing, Statistical 
language models, TF-IDF weighting, absolute discounting smoothing, backoff smoothing, 
interpolation smoothing, leave-one-out, risk minimization, term weighting, two-stage 
smoothing 
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12 Comprehensive statistical method for protein fold recognition 

Jadwiga R. Bierikowska, Lihua Yu, Sophia Zarakhovich, Robert G. Rogers, Temple F. Smith 
April 2000 Proceedings of the fourth annual international conference on 
Computational molecular biology 

Full text available: * g|pdf(1.08 MB) Additional Information: full citation , abstract , references 

We present a protein fold recognition method that uses a comprehensive statistical 
interpretation of structural Hidden Markov Models (HMMs). The structure/fold recognition is 
done by summing the probabilities of all sequence-to-structure alignments Conventionally, 
Boltzmann statistics dictate that the optimal alignment can give an estimate of the lowest 
free energy of the sequence conformation imposed by the structural model. The alignment is 
optimized for a scoring function that is interpret ... 

13 Im proving statistical language model performance with automatically generated word 
hierarchies 

John G. McMahon, Francis J. Smith 

June 1996 Computational Linguistics, volume 22 issue 2 

Full text available: 



^|pdf(2.02 MB) Wp Additional Information: full citation , abstract , references , citings 
Publisher Site 

An automatic word-classification system has been designed that uses word unigram and 
bigram frequency statistics to implement a binary top-down form of word clustering and 
employs an average class mutual information metric. Words are represented as structural 
tags— -n-bit numbers the most significant bit-patterns of which incorporate class information. 
The classification system has revealed some of the lexical structure of English, as well as 
some phonemic and semantic structure. The syst ... 

14 Challenges in information retrieval and language modeling: report of a workshop held 
at the center for intelligent information retrieval, University of Massachusetts Amherst, 
September 2002 

James Allan, Jay Aslam, Nicholas Belkin, Chris Buckley, Jamie Callan, Bruce Croft, Sue Dumais, 
Norbert Fuhr, Donna Harman, David J. Harper, Djoerd Hiemstra, Thomas Hofmann, Eduard 
Hovy, Wessel Kraaij, John Lafferty, Victor Lavrenko, David Lewis, Liz Liddy, R. Manmatha, 
Andrew McCallum, Jay Ponte, John Prager, Dragomir Radev, Philip Resnik, Stephen Robertson, 
Roni Rosenfeld, Salim Roukos, Mark Sanderson, Rich Schwartz, Amit Singhal, Alan Smeaton, 
Howard Turtle, Ellen Voorhees, Ralph Weischedel, Jinxi Xu, ChengXiang Zhai 
April 2003 ACM SIGIR Forum, Volume 37 Issue 1 

Full text available: ||| pdf(1.60 MB) Additional Information: full citation 



15 Automatic combination of multiple ranked retrieval systems 
Brian T. Bartell, Garrison W. Cottrell, Richard K. Belew 

August 1994 Proceedings of the 17th annual international ACM SIGIR conference on 
Research and development in information retrieval 

Full text available: ^ pdf(834.64 KB) Additional Information: full citation , references , citings , index terms 



16 Music information retrieval: Harmonic models for polyphonic music retrieval 
Jeremy Pickens, Tim Crawford 

November 2002 Proceedings of the eleventh international conference on Information 
and knowledge management 

Full text available: ^pdf(155.14 KB) Additional Information: full citation , abstract , references , citings 

Most work in the ad hoc music retrieval field has focused on the retrieval of monophonic 
documents using monophonic queries. Polyphony adds considerably more complexity. We 
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present a method by which polyphonic music documents may be retrieved by polyphonic 
music queries. A new harmonic description technique is given, wherein the information from 
all chords, rather than the most significant chord, is used. This description is then combined 
in a new and unique way with Markov statistical methods t ... 

17 Dependency networks for inference, collaborative filtering, and data visualization j| 
David Heckerman, David Maxwell Chickering, Christopher Meek, Robert Rounthwaite, Carl 

Kadie 

September 2001 The Journal of Machine Learning Research, volume l 
Full text available: ^ pdf(337.07 KB) Additional Information: full citation , abstract 

We describe a graphical model for probabilistic relationships— an alternative to the Bayesian 
network— called a dependency network. The graph of a dependency network, unlike a 
Bayesian network, is potentially cyclic. The probability component of a dependency network, 
like a Bayesian network, is a set of conditional distributions, one for each node given its 
parents. We identify several basic properties of this representation and describe a 
computationally efficient procedure for learning the gra ... 

18 Research track: Nantonac collaborative filtering: recommendation based on order Q 

responses 
Toshihiro Kamishima 

August 2003 Proceedings of the ninth ACM SIGKDD international conference on 
Knowledge discovery and data mining 

Full text available: ^ pdfd 34.24 KB) Additional Information: full citation, abstract , references , ind ex terms 

A recommender system suggests the items expected to be preferred by the users. 
Recommender systems use collaborative filtering to recommend items by summarizing the 
preferences of people who have tendencies similar to the user preference. Traditionally, the 
degree of preference is represented by a scale, for example, one that ranges from one to 
five. This type of measuring technique is called the semantic differential (SD) method. Web 
adopted the ranking method, however, rather than the SD metho ... 

Keywords: collaborative filtering, order, recommender system 



19 Accurate static estimators for program optimization 

Tim A. Wagner, Vance Maverick, Susan L. Graham, Michael A. Harrison 

June 1994 ACM SIGPLAN Notices , Proceedings of the ACM SIGPLAN 1994 conference 

on Programming language design and implementation, Volume 29 issue 6 

i- mx * i ui 0 Jf/H ..m Additional Information: full citation , abstract , references , citings , index 

Full text available: TO pdf(1.Q4 MB ) 

^ terms 

Determining the relative execution frequency of program regions is essential for many 
important optimization techniques, including register allocation, function inlining, and 
instruction scheduling. Estimates derived from profiling with sample inputs are generally 
regarded as the most accurate source of this information; static (compile-time) estimates 
are considered to be distinctly inferior. If static estimates were shown to be competitive, 
however, their convenience would outweigh minor ... 

20 Searching the Web 

August 2001 ACM Transactions on Internet Technology (TOIT), volume l issue l 

^ ii i i .. . , a .r /riJtn no is rj\ Additional Information: full citation , abstract , references , citings , index 

Full text available: THpdf(319.98 KB) s -' 

terms , review 

We offer an overview of current Web search engine design. After introducing a generic 
search engine architecture, we examine each engine component in turn. We cover crawling, 
local Web page storage, indexing, and the use of link analysis for boosting search 
performance. The most common design and implementation techniques for each of these 
components are presented. For this presentation we draw from the literature and from our 
own experimental search engine testbed. Emphasis is on introduci ... 
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Keywords: HITS, PageRank, authorities, crawling, indexing, information retrieval, link 
analysis, search engine 
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1 Tree induction vs. logistic regression: a learning-curve analysis 
Claudia Perlich, Foster Provost, Jeffrey S. Simonoff 
December 2003 The Journal of Machine Learning Research, Volume 4 

Full text available: pdf(263.37 KB) Additional Information: full citation , abstract , citings , index terms 

Tree induction and logistic regression are two standard, off-the-shelf methods for building 
models for classification. We present a large-scale experimental comparison of logistic 
regression and tree induction, assessing classification accuracy and the quality of rankings 
based on class-membership probabilities. We use a learning-curve analysis to examine the 
relationship of these measures to the size of the training set. The results of the study show 
several things. (1) Contrary to some prior o ... 



2 Technique for automatically correcting words in text 
Karen Kukich 

December 1992 ACM Computing Surveys (CSUR), Volume 24 issue 4 

Additional Information: full citation , abstract , references , citings , index 
terms , review 



Full text available: pdf(6.23 MB) 



Research aimed at correcting words in text has focused on three progressively more difficult 
problems:(l) nonword error detection; (2) isolated-word error correction; and (3) context- 
dependent work correction. In response to the first problem, efficient pattern-matching and 
n-gram analysis techniques have been developed for detecting strings that do not appear in 
a given word list. In response to the second problem, a variety of general and application- 
specific spelling cor ... 

Keywords: n-gram analysis, Optical Character Recognition (OCR), context-dependent 
spelling correction, grammar checking, natural-language-processing models, neural net 
classifiers, spell checking, spelling error detection, spelling error patterns, statistical- 
language models, word recognition and correction 



Modeling user behavior: Pre dicting human interruptibility with sensors: a Wizard of Oz 
feasibility study 

Scott Hudson, James Fogarty, Christopher Atkeson, Daniel Avrahami, Jodi Forlizzi, Sara 
Kiesler, Johnny Lee, Jie Yang 

April 2003 Proceedings of the conference on Human factors in computing systems 

Full text available- * g| pdf(727 56 KB) Additional Information: full citation , abstract , references, citings , index 

A person seeking someone else's attention is normally able to quickly assess how 
interruptible they are. This assessment allows for behavior we perceive as natural, socially 
appropriate, or simply polite. On the other hand, today's computer systems are almost 
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entirely oblivious to the human world they operate in, and typically have no way to take into 
account the interruptibility of the user. This paper presents a Wizard of Oz study exploring 
whether, and how, robust sensor-based predictions of ... 

Keywords: context-aware computing, machine learning, sensor-based interfaces, 
situationally appropriate interaction 
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4 Special issue on using large corpora: II: Coping with ambiguity and unknown words 
through probabilistic models 

Ralph Weischedel, Richard Schwartz, Jeff Palmucci, Marie Meteer, Lance Ramshaw 
June 1993 Computational Linguistics, Volume 19 issue 2 

Full text available: ^ ... . . [fjjl 

H|pdf(1.59 MBjVO 1 Additional Information: full citation , abstract , references 

Publisher Site 

From spring 1990 through fall 1991, we performed a battery of small experiments to test the 
effectiveness of supplementing knowledge-based techniques with probabilistic models. This 
paper reports our experiments in predicting parts of speech of highly ambiguous words, 
predicting the intended interpretation of an utterance when more than one interpretation 
satisfies all known syntactic and semantic constraints, and learning caseframe informationfor 
verbsfrom example uses. From these experiments, w ... 

5 Dependency networks for inference, collaborative filterin g , and data visualization 
David Heckerman, David Maxwell Chickering, Christopher Meek, Robert Rounthwaite, Carl 
Kadie 

September 2001 The Journal of Machine Learning Research, Volume l 
Full text available: ^ pdf(337.07 KB) Additional Information: full citation , abstract 

We describe a graphical model for probabilistic relationships—an alternative to the Bayesian 
network—called a dependency network. The graph of a dependency network, unlike a 
Bayesian network, is potentially cyclic. The probability component of a dependency network, 
like a Bayesian network, is a set of conditional distributions, one for each node given its 
parents. We identify several basic properties of this representation and describe a 
computationally efficient procedure for learning the gra ... 

6 Improving statistical language model performance with automatically generated word 
hierarchies 

John G. McMahon, Francis J. Smith 

June 1996 Computational Linguistics, Volume 22 issue 2 

Full text available* 



^ pdf(2.02 MB) W> Additional Information: full citation , abstract , references , citings 
Publisher Site 

An automatic word-classification system has been designed that uses word unigram and 
bigram frequency statistics to implement a binary top-down form of word clustering and 
employs an average class mutual information metric. Words are represented as structural 
tags— n-bit numbers the most significant bit-patterns of which incorporate class information. 
The classification system has revealed some of the lexical structure of English, as well as 
some phonemic and semantic structure. The syst ... 

Recommender systems using linear classifiers 
Tong Zhang, Vijay S. Iyengar 

March 2002 The Journal of Machine Learning Research, volume 2 

Additional Information: full citation , abstract , references , citings , index 



Full text available: 

terms 

Recommender systems use historical data on user preferences and other available data on 
users (for example, demographics) and items (for example, taxonomy) to predict items a 
new user might like. Applications of these methods include recommending items for 
purchase and personalizing the browsing experience on a web-site. Collaborative filtering 
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methods have focused on using just the history of user preferences to make the 
recommendations. These methods have been categorized as memory-based 
Keywords: collaborative filtering, decision trees, linear models, recommender systems, 
unbalanced data 



Classification and regression: money *can* grow on trees 
Johannes Gehrke, Wie-Yin Loh, Raghu Ramakrishnan 

August 1999 Tutorial notes of the fifth ACM SIGKDD international conference on 

Knowledge discovery and data mining 

ii i l -, . , ci, M/n nc - Rflm Additional Information: full citation , abstract , references , citings, index 

Full text available: TO pdf(2.95 MB) ^~ 

terms 

With over 800 million pages covering most areas of human endeavor, the World-wide Web is 
a fertile ground for data mining research to make a difference to the effectiveness of 
information search. Today, Web surfers access the Web through two dominant interfaces 
clicking on hyperlinks and searching via keyword queries This process is often tentative and 
unsatisfactory Better support is needed for expressing one's information need and dealing 
with a search result in more structured ways than ... 



9 Information extraction: Is question answering an acquired skill? 

Ganesh Ramakrishnan, Soumen Chakrabarti, Deepa Paranjpe, Pushpak Bhattacharya 
May 2004 Proceedings of the 13th conference on World Wide Web 

Full text available: ^ pdf(260.13 KB) Additional Information: full citation , abstract , references, index te r m s 

We present a question answering (QA) system which learns how to detect and rank answer 
passages by analyzing questions and their answers (QA pairs) provided as training data. We 
built our system in only a few person-months using off-the-shelf components: a part-of- 
speech tagger, a shallow parser, a lexical network, and a few well-known supervised learning 
algorithms. In contrast, many of the top TREC QA systems are large group efforts, using 
customized ontologies, question classifiers, and highl ... 




Keywords: machine learning, question answering 



10 Learning and making decisions when costs and probabilities are both unknown 
Bianca Zadrozny, Charles Elkan 

August 2001 Proceedings of the seventh ACM SIGKDD international conference on 

Knowledge discovery and data mining 

i- M * * , u 0 . f/Aon Additional Information: full citation , abstract , references , citings , index 

Full text available: TO pdf(920.23 KB) * 

terms 

In many data mining domains, misclassification costs are different for different examples, in 
the same way that class membership probabilities are example-dependent. In these 
domains, both costs and probabilities are unknown for test examples, so both cost 
estimators and probability estimators must be learned. After discussing how to make optimal 
decisions given cost and probability estimates, we present decision tree and naive Bayesian 
learning methods for obtaining well-calibrated probability ... 

11 Automated assistants to aid humans in understanding team behaviors 
Taylor Raines, Milind Tambe, Stacy Marsella 

June 2000 Proceedings of the fourth international conference on Autonomous agents 

Full text available: ^|| pdf(1.09 MB) Additional Information: full citation , references , index terms 



12 Accurate static estimators for program optimization 

Tim A. Wagner, Vance Maverick, Susan L. Graham, Michael A. Harrison 

June 1994 ACM SIGPLAN Notices , Proceedings of the ACM SIGPLAN 1994 conference 
on Programming language design and implementation, volume 29 issue 6 
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Full text available: *g| pdf(1.04 MB) Additional Information: full citation , abstract , references , citings, index 

terms 

Determining the relative execution frequency of program regions is essential for many 
important optimization techniques, including register allocation, function inlining, and 
instruction scheduling. Estimates derived from profiling with sample inputs are generally 
regarded as the most accurate source of this information; static (compile-time) estimates 
are considered to be distinctly inferior. If static estimates were shown to be competitive, 
however, their convenience would outweigh minor ... 

1 3 Magical thinkin g in data mining: lessons from ColL challenge 2000 21 
Charles Elkan 

August 2001 Proceedings of the seventh ACM SIGKDD international conference on 

Knowledge discovery and data mining 

r- .. * * ■, u 0 cc Additional Information: full citation , abstract , references , citings , index 

Full text available: 1m pdf(602.56 KB) — 

terms 

ColL challenge 2000 was a supervised learning contest that attracted 43 entries. The authors 
of 29 entries later wrote explanations of their work. This paper discusses these reports and 
reaches three main conclusions. First, naive Bayesian classifiers remain competitive in 
practice: they were used by both the winning entry and the next best entry. Second, 
identifying feature interactions correctly is important for maximizing predictive accuracy: this 
was the difference between the winning classi ... 

14 Improving accuracy in word class tagging through the combination of machine learning Q 
systems 

Hans van Halteren, Walter Daelemans, Jakub Zavrel 
June 2001 Computational Linguistics, volume 27 issue 2 

Full text available:^ t „ n _ fc ||| 



pdf(2 i 37_MB)_WII! Additional Information: full citat i on , abstract, reference s 

Publisher Site 

We examine how differences in language models, learned by different data-driven systems 
performing the same NLP task, can be exploited to yield a higher accuracy than the best 
individual system. We do this by means of experiments involving the task of 
morphosyntactic word class tagging, on the basis of three different tagged corpora. Four 
well-known tagger generators (hidden Markov model, memory-based, transformation rules, 
and maximum entropy) are trained on the same corpus data. After comparis ... 

15 Automatic combination of multiple ranked retrieval systems 
Brian T. Bartell, Garrison W. Cottrell, Richard K. Belew 

August 1994 Proceedings of the 17th annual international ACM SIGIR conference on 
Research and development in information retrieval 

Full text available:^ pdf(834.64 KB) Additional Information: full citation , references , citings, index terms 



1 6 Computational strategies for object recognition 
Paul Suetens, Pascal Fua, Andrew J. Hanson 
March 1992 ACM Computing Surveys (CSUR), Volume 24 Issue l 

r- ii * ^ 0 , f/c: 0 -, RJ . m Additional Information: full citation , a bstract , references , citings, index 

Full text available: TO pdf(6.37 MB) x * 

terms , review 

This article reviews the available methods for automated identification of objects in digital 
images. The techniques are classified into groups according to the nature of the 
computational strategy used. Four classes are proposed: (1) the simplest strategies, which 
work on data appropriate for feature vector classification, (2) methods that match models to 
symbolic data structures for situations involving reliable data and complex models, (3) 
approaches that fit models to the photometry and ... 

Keywords: image understanding, model-based vision, object recognition 
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17 Stable algorithms for link analysis 
Andrew Y. Ng, Alice X. Zheng, Michael I. Jordan 

September 2001 Proceedings of the 24th annual international ACM SIGIR conference on 

Research and development in information retrieval 

— ii , . u. « ^/o^o nA Additional Information: full citation , abstract , references , citings , index 
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The Kleinberg HITS and the Google PageRank algorithms are eigenvector methods for 
identifying * A authoritative" or 1 influential" articles, given hyperlink or citation information. 
That such algorithms should give reliable or consistent answers is surely a desideratum, and 
in~\cite{ijcaiPaper}, we analyzed when they can be expected to give stable rankings under 
small perturbations to the linkage patterns. In this paper, we extend the analysis and show 
how it gives insight into ways of de ... 
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Parallel texts (bitexts) have properties that distinguish them from other kinds of parallel 
data. First, most words translate to only one other word. Second, bitext correspondence is 
typically only partial— many words in each text have no clear equivalent in the other text. 
This article presents methods for biasing statistical translation models to reflect these 
properties. Evaluation with respect to independent human judgments has confirmed that 
translation models biased in this fashion are si ... 
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In many applications, users specify target values for certain attributes, without requiring 
exact matches to these values in return. Instead, the result to such queries is typically a 
rank of the "top k" tuples that best match the given attribute values. In this paper, we study 
the advantages and limitations of processing a top-k query by translating it into a single 
range query that a traditional relational database management system (RDBMS) can process 
efficiently. In particular, ... 
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We offer an overview of current Web search engine design. After introducing a generic 
search engine architecture, we examine each engine component in turn. We cover crawling, 
local Web page storage, indexing, and the use of link analysis for boosting search 
performance. The most common design and implementation techniques for each of these 
components are presented. For this presentation we draw from the literature and from our 
own experimental search engine testbed. Emphasis is on introduci ... 

Keywords: HITS, PageRank, authorities, crawling, indexing, information retrieval, link 
analysis, search engine 
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